Slovenian Text-to-Speech Synthesis for Speech User Interfaces

نویسندگان

  • Jerneja Zganec-Gros
  • Ales Mihelic
  • Nikola Pavesic
  • Mario Zganec
  • Stanislav Gruden
چکیده

The paper presents the design concept of a unitselection text-to-speech synthesis system for the Slovenian language. Due to its modular and upgradable architecture, the system can be used in a variety of speech user interface applications, ranging from server carrier-grade voice portal applications, desktop user interfaces to specialized embedded devices. Since memory and processing power requirements are important factors for a possible implementation in embedded devices, lexica and speech corpora need to be reduced. We describe a simple and efficient implementation of a greedy subset selection algorithm that extracts a compact subset of high coverage text sentences. The experiment on a reference text corpus showed that the subset selection algorithm produced a compact sentence subset with a small redundancy. The adequacy of the spoken output was evaluated by several subjective tests as they are recommended by the International Telecommunication Union ITU. Keywords—text-to-speech synthesis, prosody modeling, speech user interface.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Study on Unit-Selection and Statistical Parametric Speech Synthesis Techniques

One of the interesting topics on multimedia domain is concerned with empowering computer in order to speech production. Speech synthesis is granting human abilities to the computer for speech production. Data-based approach and process-based approach are the two main approaches on speech synthesis. Each approach has its varied challenges. Unit-selection speech synthesis and statistical parametr...

متن کامل

Design of Optimal Slovenian Speech Corpus for Use in the Concatenative Speech Synthesis System

In the paper the development of Slovenian speech corpus for use in concatenative speech synthesis system being developed at University of Maribor, Slovenia, will be presented. The emphasis in the paper is the issue of maximising the usefulness of the defined speech corpus for concatenation purposes. Usefulness of the speech corpus very much depends on the corresponding text and can be increased...

متن کامل

Syllable and Segment Duration at Different Speaking Rates in the Slovenian Language

Speech timing at different speaking rates was studied for the Slovenian language and the results were applied in the two level duration prediction model in the Slovenian text-to-speech system S5 [1]. In order to provide the synthesiser with the possibility to pronounce input text with several speaking rates, tests were made to study the impact of speaking rate on syllable duration and duration ...

متن کامل

Syllable and segment duration at different speaking rates in the Slovenian language

Speech timing at different speaking rates was studied for the Slovenian language and the results were applied in the two level duration prediction model in the Slovenian text-to-speech system S5 [1]. In order to provide the synthesiser with the possibility to pronounce input text with several speaking rates, tests were made to study the impact of speaking rate on syllable duration and duration ...

متن کامل

The New Slovenian Text-to-Speech System

Human-computer interaction in a natural language is becoming possible due to rapid development of computer power. While text-to-speech (TTS) systems for major world languages are quite advanced, smaller languages, like our Slovenian language, lack quality TTS synthesis. At the "Jozef Stefan" Institute a system called GOVOREC (SPEAKER) has been developed which is capable of automatic conversion ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005